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Abstract 

The issue of treatiiient asslqnmeht is ordinarily dealt with within the 
framework of testing Aptitude Treatment Interaction hypotheses. ATI research 
mostly uses linear regression techniques, and an ATI exist when the AT 
regression lines cross each other within the relevant interval of the ap- 
titude variable. Consistent with this approach is the use of the points of 
interaction of AT regression lines as treatment-assignment rule, in this 
paper, it is proposed to replace such rules by monotone, nonrandomized 
(Bayes) rules. Both continuous and dichotomous criteria for treatment success 
are considered. An example of the latter can be found in individualized in- 
struction when learning success is evaluated using a mastery test. The 
solutions given in this paper are based on linear, normal ogive, and thresh- 
old utility functions. Finally, some modifications of these functions are 
discussed which are believed to be more realistic in the context of individ- 
ualized instruction but for which no optimal monotone assignment rules are 
available yet. 

Keywords: Aptitude Treatment Interaction, Decision Theory, Mastery testing. 
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statistical AsjDects of Optimal Treatment Assignment 

The problem we will consider in this paper is the following; SupjDOse 
we have a population of subjects and a number of different treatments to 
Which subjects of this jDdpulation are to be assigned. Furthermore ^ it is 
suppiosed that a criterion variable is available measuring the effect of 
the treatment* Final lyi there is a predictor variable which can be used 
for predicting the criterion scores Of the subjects for each of the treat- 
ments. The problem how consists of choosing a decision rule that assigns 
subjects to treatments on the basis of their predictor scores such that the 
assigrimeht procedure is optimal in some sense. 

An example of this problem can be found in individualized instructional 

systems. In these systems students are required to reach the same learning 

objectives but different instructional programs or treatments are available. 

Typically, the assignment of students to these treatments is based on their 

scores on an aptitude test administered previous to the instructional unit, 

while at the end of the unit a mastery test is administered to determine 

• .. _ ... 

whether the student has reached the learning objectives and may proceed with 

the next unit or has to receive additional instruction. Individualized in- 
struction has mainly been motivated by the view-point underlying Aptitude 
Treatment interaction (ATI) research (Bronbach & Snow, 1977), namely that 
subjects can react differently to treatments and that treatments which are 
best on the average may therefore be worst in individual cases. 

ether examples of situations to which the problem of this paper applies 
can be found, e.g., in psychotherapy, management sciences^ medicine^ and 
agricul tural sciences. 
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Let X arid Y deriote the predictor and criterion variable measared before 
and after the treatment* respectively. Since educational and psychological 
measurement iristrumerits are mostly tests, X and Y will be assumed to be 
integer-valued variables ranging from 0, m and 0, n, respectively. 

The possible treatments will be indexed by j = 0, t. Furthermore^ for 

each treatmerit d a probability function nj(x,y) will be adopted representing 
the relatidri between X and Y under treatment j. To select optimal decision 
rules, ari evaluation of the decision outcomes or utility function is needed. 
For the present paper it is sufficiently general to consider the utility, 
U say* as a function of the criterion Y, which is allowed to assume a 
different shape for each treatment: U = 

We shall first restrict the treatment-assignment problem to the case of 
two treatments. Moreover, it will be assumed that the optimal assignment 
rules we will be looking for can be found in the subclass of rules khbwri as 
monotone rules (Ferguson, 1967, sect. 6.1). For the case of two treatmerits 
this means that the optimal rule has the form of a cutting score b on the 
predictor X such that for predictor scores X < b one of the treatmerits and 
for X b the other treatment is assigned. The conditions which must be 
imposed on the utility arid probability furictibris to arrive at optimal rules 
of a monotone form are discussed in van der Liriden (1980a). 

For each possible monotone assignment rule, the expected utility is 
given by 

b-1 n m n 

(1) B(b) = I I UQ(y)no(x,y) + I I Oj(y)ni(x,y) . 

x=0 y=0 x=b y=0 
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We shall use (1) as our optimization criterion and look for cutting scores 
for which (1) is maximal. In doing so, we may use the important fact that, 
although the bivariate distributions n.(x,y) can be expected to assume a 
different shape for each treatment, this does not apply to the marginal 
distributions of the predictor scores inasmuch as these are measured pre- 
vious to the treatments. Thus, denoting the probability function of X by 
X(x), it holds that (1) may be optimized using the fact that 

(2) X^(x) = X(x) 

for all values of j. 

In this paper we will show some results for linear, normal ogives and 
threshold utility functions. No derivations will be given;; these can be 
found in van der Linden (1980a). The threshold utility fuhctidri seems real- 
istic when the criterion is a dichotomy, as is the case, for example ^ in 
individualized instruction when the criterion is a test used for mastery 
decisions (van der Linden, ig80b). We will also discUss other Utility 
functions suited for the case of a dichbtombus criterion ^ which are believed 
to be more realistic in the context bf individualized instruction than this 
threshold Utility function but fbr which he solutions are available yet. 

Befbre prbceeding^ however ^ we observe that^ although in part based on 
different assumpitibhs , the approach taken in this paper comes close to the 
one chbsen by Crbribach arid Gleser (1965, Appendix 1) in their model for 
placement decisions. Another approach to the present problem has been used 
by Vijn (1980). 
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Linear Utility 

For the ease of two treatments (t=l) with Utility and probability 
fdrietions obeying the conditions leading to sdlutidhs in the subclass of 
monotone rules, it holds that the optimal cutting scores b , on the predictor 
is equal to the smallest valUe of x for which 

(3) EiCU.(Y)!x] - EqCUq(Y)|x] 

is positive, this solutions which involves a simple comparison between two 

cbriditibhal expected utilities, follows in a few steps from (1) and (2). It 

should be noteds however, that (3) is no closed-form solution and that 

further restrictions are required to arrive at such solutions. 

We next suppose that the utility functions u.(Y) have a linear shape: 

•J 

The parameter g: can, for instance, be a nonpositive constant representing 
the costs of treatment j. The relation between utility and criterion score 
is also determined by the (treatment-dependent) parameter f ^ . 

If the regression functions of Y on X may be assumed to be linear, it 
follows from (3) that for utility function (4) the treatment assignments 
are optimal for that value of b equal to 

. / 9o - 91 + fd"o - 
(5) b* = entier ( ) + li 

o 
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where the entier function replaces the value of its argument by the greatest 
integer not greater than this value and a. and B. are the regression j3ara- 
meters given by E.(Yjx) = a. + g.x. 

Note that in applications the regression parameters in (5) must be 
estimated and that this solution can therefore be unstable when fgpQ - ^i^i' 



As an alternative to linear utility function (4), we next consider the 
following normal ogive utility function 



where i is the standard normal e.d.f. and the ( treatment-deperideht) para- 
meters p. and a: govern the location and slope of (6). The Use of cumulative 
distribution functions as utility function has been plead by Berhdld (1973)* 
bindley (1975), and Novick and Lindley (1978). An attractive feature of (6) 
is that it can readily be combined with the model of a normal distribution 
for Y given X = x. Assuming such a model along with linear regression 
functions of Y oh X and hcimdscedastici ty * it can be shown that the following 
optimal assignment rUle is obtained. 




(6) 




b* = 




(7) 



entier 



+ 1 
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with 




and 




where Cor:(X,Y) and Var.(Y) are the linear eorrelatibn coefficient between 
X and Y and the variance of Y under treatment j, respectively. 



Next, we suppose that criterion Y has a threshold value c so that Y 5 c 

means success and Y < c failure. An example of this arises in individualized 
instruction when the criterion is a mastery test with mastery score c. 

A utility function fitting this situation can be the following threshold 
utility function with parameter c 



Jiireshold Utility 



fw + a. 



for Y 



(8) 





for Y < c. 



ih which w and v rejDreseht the treatmerit-ihdependeht and a_. the treatment- 
dejDeriderit part (e.g.* treatment costs) of the utility structure. For utility 
function (8), it appears that the optimal treatment-assignment rule is the 



a 



smallest. value of x for which 



(9) 



S2q(c|x) - nj(c|x) + aQ - 



is positive, s;^. being the c.d.f. of Yjx. It also appears that when a^ = a^ 
and the conditional distributions of Y given x are normal with linear 
regression functions and hoinoscedasti city , the optimal assignment rule is 
given by 



(Id) 



b = entier 



(c - aj)CVarQ(Y.X)]^'^^ - (e - aQ)EVarj(Y.X)3 



1/2 



3lCVarQ(Y.X)]^'^^ - gQCVar^CY.X. )3 



i/2 



+ 1 



Note that this solution is a function not only of the regression parameters* 
a: and 3.. and the "unexplained" variances, Var.(Y.X), but also of the 
threshold value c. 

In all. three utility functions considered so far, utility is a function 
of the observed criterion scores. In the event of unreliable criterion scores 
or criterion scores that are inefficient estimators of an underlying* latent 
parameter, it seems better to revise the utility functions and to define 
them as functions of the true criterion score T. As has been indicated in 
van der Linden (1980a), thic f^oes not change the solutions given above for 
the linear and normal ogive utility function but has consequences for (10). 
Not only must c be replaced by the true threshold value d dh T but also 
Var.(Y.X) by Var^(T.X), while in the more general solution in (9) the c.d.f. 
of Y|x must be replaced by the c.d.f. of T|x. 
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jpeatment Assignment ahdjjastery Testing 

As noted earlier, an example of a treatment-assignment problem with 
a dichotomized criterion can be found in individualized instruction when 
learning success is evaluated by the administration of a mastery test at the 
end of the instructional unit. Mastery testing has also been approached from 
a decision-theoretic point of view (e.g. flambleton & Novick. 1973? for a 
review, see van der Linden. 1980b). In short, the mastery testing problem 
consists of two possible states obtained by dichotomizing the true-score 
scale* T, underlying the test by denoting a mastery score d. Students 
exceeding this score (T 3. d) are considered masters, the others (T < d) 
nonmasters. The problem is how to find a cutting score c on the observed- 
score variable. X. such that students are optimally classified as masters 
(X 5» c) and nonmasters (X < c). 

It is interesting to note that this problem fits the treatment- 
assignment problem with an unreliable, dichotomized criterion, which sug- 
gests a further integration of treatment assignment and mastery testing. A 
fruitful approach seems to adapt the utility functions in use for mastery 
testing to the fact that in individualized instruction mastery decisions 
are preceded by treatment-assignment decisions and to optimize both decisions 
simultaneously. 

Three examples of utility functions suited for this purp5se will be 
shown. The first example is 
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(11) 



"01 ^ ^d 



for T 5 d, Y < c 
for T < d, Y 5 c 
otherwise i 



which is the threshold utility function in use in mastery testing (Hamble- 
ton & Novick, 1973; Huynh, 1976; Mellehbergh, Koppelaar, and van der binden, 
1977) extended with a treatmeht-depehdeht parameter a. to represent, e.g., 
treatment costs. 

The next example is an adaptation of the linear utility function in- 
troduced in mastery testing to remove the unrealistic discontinuity at 
T = d in (11) (van der binden S Mellenbergh, 1976): 



(12) 



boj(d - T) . a,. 



i^b^-d - d) . a^. 



for Y < c 



for Y 5: c. 



The adaption is that the parameters a^^. , a^^. , b^^. , and b^^: have been made 
treatment dependent to be able to account for possible differences in 
utility and costs between treatments. 

The final example is an adaptation of the normal ogive utility function, 
which has been introduction in mastery testing by Novick and Lindley (1978) 
as ah illustration of the use of cumulative distribution functions for 
representing utility structures: 
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, - T 



Oj 



for Y < c 



Oj 



(13) 



+ a 



Ij 



for Y ^ c 



The parameters jjgj, y^j, Ogj, and a^^ have, just as in the preceding exampi 
df the linear utility function, been made treatment dependent. Besides, the 
parameters a^j and a^^^ have been added to allow for treatment costs. 

In principle, solutions for these three utility functions must be ob- 
tained by defining the expected utility using the trivariate distribution 
of (X,Y,T) and optimizing the resulting expression simultaneously to the 
cutting scores b on the predictor X and e on the criterion Y. Although 
it is believed that these can lead to an improvement on existing treatment- 
assignment and mastery decisions in individualized instruction, no closed- 
form solutions are available yet. This has to do with the fact that the 
optimization involved in this procedure is rather involved and that the 
conditions under which monotone solutions can be expected are not yet clear 



For a fuller discussion of the treatment-assignment problem in the 
first part of this jDaper^ a general Izatioh thereof to more than two treat- 
merits, a procedure for combining qualitative information with predictor 



Concluding Remark 
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scores to improve treatment assignment, and an "empirical" procedure to be 
used when the conditions leading to monotone rules are not met^ we refer 
to van der Linden (igSQb). 
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